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Approximate ML Decision Feedback Block 
Equalizer for Doubly Selective Fading 

Channels 

Lingyang Song, Rodrigo C. de Lamare, Are Hj0rungnes, and Alister G. Bun- 
Abstract 

In order to effetively suppress intersymbol interference (ISI) at low complexity, we propose in this 
paper an approximate maximum likelihood (ML) decision feedback block equalizer (A-ML-DFBE) 
for doubly selective (frequency-selective, time-selective) fading channels. The proposed equaUzer 
design makes efficient use of the special time-domain representation of the multipath channels through 
a matched filter, a sliding window, a Gaussian approximation, and a decision feedback. The A-ML- 
DFBE has the following features: 1) It achieves performance close to maximum likelihood sequence 
estimation (MLSE), and significantly outperforms the minimum mean square error (MMSE) based 
detectors; 2) It has substantially lower complexity than the conventional equalizers; 3) It easily 
realizes the complexity and performance tradeoff by adjusting the length of the sliding window; 4) It 
has a simple and fixed-length feedback filter The symbol error rate (SER) is derived to characterize 
the behaviour of the A-ML-DFBE, and it can also be used to find the key parameters of the proposed 
equalizer. In addition, we further prove that the A-ML-DFBE obtains full multipath diversity. 

Index Terms 

Doubly selective fading channels, equalization, matched filter, linear MMSE, MMSE-DFE, max- 
imum UkeUhood sequence estimation. 

L INTRODUCTION 

Wireless communications often suffer from severe inter-symbol interference (ISI) due to doubly 
selective fading. In order to suppress the channel distortion, channel equalization techniques are 
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essential, and indeed have received considerable attention for many years. Maximum a priori proba- 
bility (MAP) equalization is the optimum equalization procedure in terms of minimum symbol error 
rate (SER) m, but requires a prohibitive computational complexity for many applications, being 
exponential in the channel length and constellation size. Maximum likelihood sequence estimation 
(MLSE) can obtain SER performance very close to MAP, but its complexity is still extremely high |2|. 
As a result, many sub-optimal, low-complexity equalization techniques have been proposed, such 
as the popular minimum mean square error decision-feedback equalizer, which is very effective in 
certain multipath environments and has a complexity that is only dependent on forward and backward 
filter lengths ISj. However, there is a non-negligible performance loss of MMSE based equalizers in 
comparison to MLSE gl Q. 

Further still, while lots of research have been conducted on the time-domain equalization, few 
works take the special form of the channel representation into good account. Two properties of the 
channel matrix in time domain are effectively utilized in this paper: 1) The Toeplitz-like channel 
matrix significantly contributes to the equalizer design; 2) The large number of zero elements reduces 
the computational complexity. As a result, we propose a robust approximate ML based decision 
feedback block equalizer (A-ML-DFBE) to combat ISI over doubly selective fading channels with 
low computational complexity. The proposed equalizer exploits substantial benefit from the special 
time domain representation of the multipath channels by using a matched filter, a sliding window, a 
Gaussian approximation, and a decision feedback. The main ideas are firstly to subtract the effect of 
the already-detected signals obtained from past decisions. This can be treated as a decision feedback 
process. Secondly we apply Gaussian approximation |0-|l9l to realize near maximum likelihood 
detection. The accuracy of this procedure can be improved by adjusting the length of the sliding 
window due to the central limit theorem. Consequently, a complexity and performance trade-off can 
be realized, and a convergence in SER performance can also be obtained by adjusting the length of 
the sliding window. 

Note that IS and Q can be used only for frequency fiat fading channels, and HI aims to recover 
signals for multiuser systems. Although in ||9l a probabilistic data association (PDA) based equalizer 
is reported, there are several major differences compared to the proposed approach: In |9|, it requires 
to update the mean and the variance for all detected symbols; many iterations have to be used in order 
to make the performance converge; there is no feedback process; and no matched filter is employed. 
In ifTOl . bidirectional arbitrated decision-feedback equalization (BAD) algorithm was presented which 
has complexity at least two times of the MMSE-DFE but can achieve better performance. In [11], 
a class of block DEE is presented for frequency domain equalization, but it assumes that the length 
of the channel, forward filter, and backward filter are infinitely long which is not practical. Besides, 
it requires large number of iterations to make the performance converge, which increases the system 
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delay and the computational complexity. 

The rest of the paper is organized as follows: In Section II, we present the channel and signal 
models. The proposed A-ML-DFBE scheme and complexity comparisons are discussed in Section 
III. The performance is analyzed in Section IV. Simulation results are presented in Section V. In 
Section VI, we draw the main conclusions. The proof is given in the appendix. 

Notation: Boldface upper-case letters denote matrices, boldface lower-case letters denote vectors, 
C*^-' and denote the set of ixj complex and real matrices, respectively, (•)-^ stands for transpose, 
(•)* denotes complex conjugate, (•)^ represents conjugate transpose, Ij stands for an i x i identity 



matrix, E is used for expectation, var is used for variance, and ||x| 



II. Channel and Signal Models 
The doubly selective fading channel can be modeled using a finite impulse response (FIR) filter 

L-l 

H{z,t) = Y,hk{t)z~\ (1) 

A;=0 

where H{z,t) denotes the z transformation at time t, hi{t) represents the i-th path's channel coeffi- 
cient, and the length of the FIR filter is L. For simplicity, we only consider a single input and single 
output system. The received signals can be written in vector form as (for convenience, we drop the 
time index for each transmission frame) 



r = Hs + n, 



(2) 



where the received signals r = [ri, . . ., rAr+i„i]^, is the length of s, transmitted signals s = 
[si, . . .^snV, and n = [ni, . . ., whose elements are independent samples of a zero-mean 
complex Gaussian random variable with variance a"^ = E[|sfcp]/SNR, in which E[|sfcp] represents 
the average power of the transmitted symbols from constellation A. In this paper, we set E[|sfcp] = 1. 
The time domain representation of the doubly selective fading channel H G (^iN+L-i)xN ^ ^^j^ 
written as 

hi{0) 
/J2(0) hi{l) 

■ h2{l) hi(2) 
hL{0) h2{2) ... hi{N-l) 

hL{l) ': ... h2iN-l) 



H 











... hL{N - 1) 
Note that H has a structure similar to Toeplitz form, and some form of guard interval is necessary to 
avoid inter-block interference between the received signals |i5j|. The symbols in Q can be recovered 
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by MLSE |[T1. Alternatively, they can also be decoded in complex form using standard zero forcing 
(ZF) or MMSE approaches, linear or decision feedback equalization 131 . 

III. Description of the Proposed Method 

A. Approximate Maximum Likelihood Decision Feedback Block Equalizer (A-ML-DFBE) 

The proposed equalization algorithm can be summarized into three steps: 1) Forward process, which 
builds up the forward filter by a temporal sub matched filter; 2) Decision feedback process, which 
cancels the interference by a fixed length backward filter, and 3) Approximate ML process, which 
realizes the final signal detection by the aid of Gaussian approximation. The detailed description of 
each step is given below. 

1) Forward Process: Supposing we start decoding Sk, a temporal sub matched filter (forward filter) 
is applied to Q 

Hf r = Hf Hs + Hf n, (3) 

where denotes the matrix of size N x Lf, which is made of the entries in H, from the ^-th column 
to the {k + Lf — l)-th column and from the 1-st row to the A^-th. Lf (L < Lf < N) is the length 
of the sliding window that must be equal or larger than L for smaller inter-symbol interference and 
larger diversity gain, and smaller than or equal to A^. When Lf = N, the matched filter becomes 
H^. For simplicity, we may rewrite ([3]l as 

yfc=Js + nfc, (4) 

where = Hfr G C^f'<\ J = H G C^/^^, and = Hj^n G C^f'K We call this process 
horizonal slicing, since it takes L f rows of H. J is given by 

hfhi • • • 

: ■■■ : , (5) 

where hj G £^{n+l-i)xi (je^otes the i-th column of matrix H. The length of the forward filter has 
been defined as Lj in Q. 

2 ) Decision Feedback Process: The function of this step is to suppress the effects of the detected 
terms. 

In order to further decrease the complexity of (|4]), we can just consider a certain number of the 
transmitted symbols, and have 

Yfc ~ JfcSfc + nfc, (6) 

where Jfc G C^^^^*^"^^'"^-' can be constructed by taking the first column to the k + Lf— 1-th column of 
J in ([5]), and = [^i, . . . , We call this process as vertical slicing, since it takes k + Lf — 1 
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columns of J. Moreover, ^ can be decomposed with respect to each transmitted symbol 

k+Lf~l 

yk ~ j/^' + Sfc, (7) 

i=l 

where j, € C^'^^ stands for the i-th column of the matrix J^, and Sj represents the i-th transmitted 
symbol. Note that (|7]) is equivalent to dU) when the vertical slicing includes all the symbols in J, 
Lf = N+l — k, which implies the length of Lj will have some effect on the system performance, and 
the effect of Lf will be discussed further in the performance analysis and simulation results sections. 
We can write ^ as 

k-1 k+Lf-l 

yk - "^ii^i +jk^k + Yl j''^' + 

i=l i=k+l 

where Yl'iZih^i stands for the detected terms that can be rebuilt by the past decisions, jj^Sk is the 
current target, and Z]f=^i^ J/'^i represents the undetected terms. The function of the feedback process 
is to reconstruct X]/=i J/'^' ^^^^^ interference cancellation. Therefore, it is important to decide the 
length of the backward filter, Lf,. Based on the expressions of H and Q, we have ji = j2 = • • • = 
}k-L-2 = 0, and thus, the length of the backward filter Lb can be fixed at L — 1, (L > 1) to reconstruct 
the effects of past decisions. dSjl can be rewritten by simplifying the detected terms 

k-1 k+Lf-l 

yk- Y iiSi+yk+ Yl j'-'^' + ^fc' (9) 

i=k-Li, i=k+l 

where Lt, equals L — 1. As in Yl'i=k-Li,ii^i '■^^^ reconstructed from past decisions, the following 
past decision cancellation process can be applied 

k-1 
i=k-Li 

The above process is very similar to the decision feedback cancellation process, but unlike MMSE- 
DFE, we do not need to calculate the coefficients of the feedback filter, moreover, the length of Lf, 
is fixed at L — 1, which means that only L — 1 past decisions need to be fed back, which is much 
less than what is typically required by MMSE-DFE. 

3) Approximate ML: This step aims to achieve near optimal detection by applying the Gaussian 
approximation. We substitute ^ into (ITOl ) and get 

k+Lf-l 

yk=ikSk+ Y j/'^' + ^fc- (11) 

i=k+l 

In order to decode s^ with low computational complexity while maintaining the performance compa- 
rable to the ML decoder, we treat the undetected terms Yl'i=k+i^ h^i noise vector n^^k+Lf-i 
together as a new complex-valued Gaussian vector with matching mean and covariance matrix, such 
that (fTTI) can be expressed as 

yk = ikSk + Vk, (12) 
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where r/^ represents a vector with size x 1 of zero-mean complex- valued Gaussian random variables 
with covariance 

A^: = Jfc+lJfc+i + CJ^Jfc, (13) 

where J^+i can be constructed by using the {k + l)-th column as the {k + Lf — l)-th column of J^. 
and can be obtained by taking the A;-th column to the {k + Lf— l)-th column of J^,. According to 
the central limit theorem, the accuracy of the Gaussian assumption can be improved by increasing 
the length of the forward filter (sliding window), Lf. 

As r}j^ has an approximate Gaussian distribution, the likelihood function p{yk\sk) is given by 

P(yfck-t) oc exp (-(yfc -yk)"A^^ (y^ - j^^^)) . (14) 

Finally, Sk can be recovered by the following ML detector 

Sk = arg min ((y^ - yk)"A];^{yi, - jkSk)) , (15) 

at k = N — Lf + 1, there are no more new received signals outside the sUding window. So, we can 
then simply decode each undetected symbol by treating the rest as Gaussian term and removing the 
effects of the detected symbols. This decoding process is very similar to the case of k < N — Lf+ 1 
by fixing the sliding window. The overall A-ML-DFBE algorithm is summarized in Table Jl 



B. Computational Complexity Analysis 

Before we show the complexity comparisons, we present how to further reduce the proposed 
equalizer complexity. Note that, in (|9l), the detected terms Yl^iZk-Lth^i t)e rewritten as J^-iSk-Li 
with size Lf x 1, where Sfe_i = [sk^Lt, ■ ■ ■ , Sk-iV has size Lj, x 1. With respect to the diagonal 
element h^hg in J, when g > L, v/e can find that 



hfh^ = 0,i>g + L, 



hfhg = 0,i<g-L, 



(16) 



and thus, J^-i has the following form 



^k+l^k-Lt + l 








k^^k-l 



hf^L. 
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which has size Lf x Lf,. We can observe that there are only Xli^i ^ — ^~^j^^b non-zero elements in 
Jfc_i so that the reconstruction of the detected terms ^iZk-Ltii^i can be further simplified. Similarly, 
in (fTSl ). the calculation of and j^, can be simplified as well. 

Now, we discuss the complexity of the A-ML-DFBE, linear-MMSE [4], MMSE-DFE and BAD 
lITOl detectors in terms of the number of additions and multiplications. The resulting values are given 
in Table HIl obtained by inspection of the relevant algorithms in Table H H, and ifTOl . Details of the 
computation of complexity, for example the matrix inversion, can be found in lITll . The computational 
complexity of the A-ML-DFBE algorithm is a function of the frame length (N), the impulse response 
length (L), and the length of the forward filter (Lf), which is obtained on the basis of Table. HI From 
the table, we observe that A-ML-DFBE has the same order of complexity as the linear-MMSE and 
MMSE-DFE. But A-ML-DFBE is less complex than MMSE-DFE since the A-ML-DFBE requires 
smaller Lf value, and it does not require to build up the backward filter. In comparison to linear- 
MMSE, the A-ML-DFBE needs relatively even shorter forward filter and thus has lower complexity. 
The relation between the filter length and the performance can be clearly observed in the simulation 
results section. BAD requires complexity at least double of MMSE-DFE. Note that with regard to 
computational complexity, we focus on time-domain implementation even though a low-complexity 
frequency-domain implementation is also possible by making use of the block-circulant structure that 
can be created by the guard interval. In addition, note that the matrix inversion lemma can be used 
to reduce the complexity from cubic to quadratic order, but it does not affect the above conclusions. 

IV. Performance Analysis 

A. Analytical SER and BER Derivations 

In this subsection, we analyze the symbol error rate (SER) as well as the bit error rate (BER) 
performance of the A-ML-DFBE. Note that the tail detection only contains the operation of very few 
symbols, and thus, the performance is dominated by Step 2 of the A-ML-DFBE process in Table HI 
which will now be analyzed. We assume that all the decisions are accurate for analysis, which is a 
normal assumption in decision feedback theory [4]. In ([T2l ). which contains correlated noise, rjj^, the 
pre-whitening filter, = ^ , can be applied to make the variance of the noise uncorrelated 

where ^kVk with size Lf x I has a Gaussian distribution with zero mean and all components have 
unit variance. 

Since the noise now has become white Gaussian, the matched filter, ('J'^j^)^, can be employed 
and we have the following received signal equation in scalar form 

yk=CkSk+Vk, (18) 
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where y'^ = {^kh)" ^kyt' = ll^JfelP > and Vk = {"IfiJk)^ "^kVk' which is a scalar with zero 
mean and variance The SER for M-PSK constellation is given by |[T3]| 

SER^ = 1/ exp(-^)d^, (19) 

where ^psk = sin^ ^, 7^ = = = ll*fcjfclP> and M denotes the constellation 

size. The average BER for M-PSK can be written as: 

^ N-Lf+2 

BERm = _ ^ ,2 BER^, (20) 

fc=l 

where BEr|^ log^ m ^^^m Q ^'^'^ SNR and Gray mapping. Since the tail is normally short, 
which has length Lj — 2, in comparison to the whole frame length N, hence its effects can be 
neglected. Note that in time-invariant channel, SER]^_f = SER^ = . . . = SER^^ ^^^^ ^j^g 

property of J (71 = 72 = • • • = jN-Lf+2) by assuming perfect decision feedback at high SNR. 

B. Multipath Diversity Analysis 

Next, we analyze further the behavior of the proposed A-ML-DFBE at high SNR. Assuming perfect 
channel estimation at the receiver, and taking (fT9l ) as an example, it can be upper bounded by HI 

/ min(L^-l,L-l) \ 

SERi < ^exp (-^7.) - ^exp -— J] \h{t)?\ . (21) 

where 7^ ^ ^3x7X^^=0^^^ ^'^ at high SNR (Refer to Appendix I for the derivation). 

In order to obtain good performance in terms of multipath combining and inter-symbol interference 
suppression, we should choose Lj > L. Then, by averaging (|2TI ) over the Rayleigh PDF [14J, equation 
(I2T]) becomes 

which indicates that the A-ML-DFBE achieve the maximum multipath diversity order L. 



C. Analysis of the Length of the Forward Filter (Sliding Window) and Backward Filter 

It has been shown that the forward filter length, Lf, is a very important parameter in the proposed 
A-ML-DFBE. In this subsection, we discuss the behaviors of Lj: 1) Increasing the value of Lj can 
improve the robustness of ([TS] ) due to the following reasons: Firstly, as shown in (|5]l, larger value 
of Lj can incorporate more received signals as well as channel information in the forward filter; 
Secondly, indicated by ([T3] ). increasing Lj can make the Gaussian assumption more accurate; 2) 
While the performance can be enhanced, as shown in Table |II1 the complexity will correspondingly 
go up. Hence, for A-ML-DFBE, a complexity and performance tradeoff can be realized by adjusting 
Lj; 3) Performance gets converged by increasing the value of as the Gaussian assumption becomes 
accurate enough. This implies that moderate length of the forward filter can deliver good performance; 
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4) Given by Subsection-IV-B, Lf should be equal or larger than L for maximum diversity order; 5) 
The length of the backward, Lb, always equals L — 1 due to the special structure of H. 

D. Analysis of the Matched Filter in ((21) 

Note that the matched filter in ^ can obtain some additional information from the received signals 
outside the slicing window. Recalling dH)-®, can be written as 

y, = [h|^r,...,hf+^^_irf. (23) 

Although some information is lost after horizonal and vertical slicing, some gains can be still realized 
by considering the whole received signal, r. 

Supposing the matched filter is removed, the detection procedures in Table U can be used but it 
will lead to performance degradation since only the received signals inside the sliding window will 
be considered, where y^ = [r^, . . . , Vk+Lf-iV ■ As a result, the length of the forward filter has to be 
increased to make up the performance loss caused by the slicing processes in order to obtain the same 
performance. Note also if the length of the forward filter is equal to N, the A-ML-DFBE directly 
enters the 'Tail Detection' step (Step 3) in Table H which will make no difference in performance 
whether or not the matched filter is used since there is no slicing operations at all. However, the value 
of is normally much less than N . Theoretically, using the same methods as shown in Appendix 
I, it is easy to obtain the SNR for the A-ML-DFBE when the matched filter is removed. Due to the 
space limitation, we drop the detailed derivation part. But we can conclude that the performance of 
A-ML-DFBE can be upper-bounded by the same equalize without using the matched filter. 

V. Simulation Results 

In all simulations, BPSK constellation is used to generate a rate Ibps/Hz transmission. We plot 
the BER versus the signal-to-noise ratio (SNR). For analytical results, we assume perfect decision 
feedback, but for simulated results we use the feedback decisions. The performance is determined 
over doubly selective Rayleigh fading channels. The impulse response length is L = 5, and, thus, the 
length of the backward filter of the A-ML-DFBE can be fixed as L;, = L — 1 = 4. Jakes' Model is 
applied to construct time-selective Rayleigh fading channel for each subpath. The carrier frequency 
/c = 2 GHz and the symbol period Tg = 128/c, where c is the speed of light. The simulation results 
are plotted with two speeds: u = 5 vkm/h and 150 km/h (corresponding to fffTs = 0.0001 and 
0.0093, where Doppler frequency fd = vfc/c). The frame length N is 128. 

In Fig. [J and Fig. |2j we examine the analytical BER performance obtained in (l20l ) assuming that 
the channel estimation is perfect. The simulations are plotted with the vehicle speed: v = 5km /h. In 
Fig. [U we compare the analytical BER with the simulated BER. It can be observed that the analytical 



10 



BER is close and asymptotically converges to the simulated curves at high SNR. In Fig. |2j the 
analytical BER for A-ML-DFBE is plotted employing different forward filter lengths. As discussed 
earlier, the length of the forward filter, Lj, should be at least equal to L in order to realize good 
performance. From Fig. |2l we can see that the proposed A-ML-DFBE with Lf = 5 provides much 
better performance than that with Lj = 3, and as the value of increases, the performance begins 
to converge. It can be also seen that for A-ML-DFBE, Lj = 10 (two times L) is enough to obtain 
good BER performance. 

In Fig. [3l simulation results for the A-ML-DFBE detector are illustrated in comparison with 
conventional linear MMSE, MMSE-DFE, BAD, and MLSE decoders. The simulations are plotted 
with the vehicle speed: v = 5 km/h. Least square (LS) channel estimation [4] is used. From Fig. 
m it can be observed that at BER=10~'^, the performance of A-ML-DFBE with L/^ = 5 is far better 
than the linear MMSE and the MMSE-DFE equalizers. There is only 2 dB loss compared to the 
MLSE decoder at BER=10"^. At Lf = 10, there is about 0.8 dB loss compared to MLSE. Almost no 
difference can be observed for A-ML-DFBE when Lf is increased to 15 since = 10 is sufficient 
to make the performance converge. Note that when Lf = 15, A-ML-DFBE gives almost the same 
performance as Lf = 10, which demonstrates that only a small value of Lf is required to achieve good 
performance. We can also see that A-ML-DFBE with Lf = 5 can provide much better performance 
than BAD with Lf = 15. Note that our A-ML-DFBE has lower complexity than MMSE-DFE, and 
thus, lower than BAD. Clearly, from Fig. [T] to Fig. [3jwe can see that there exists a complexity and 
performance tradeoff in terms of Lf. Performance can be improved by increasing the length of the 
forward filter (slicing window). In addition, performance convergence can be also observed, which 
indicates that limited value of Lj is enough to deliver most of the performance gain. 

In Fig. m simulation comparisons are made for A-ML-DFBE without using the matched filter. 
Perfect channel estimation is assumed. Vehicle speed, v = 5km/ h, is adopted. We choose different 
Lf values for the no matched filter case: 5, 10, and 15 and L^ remains the same: 4. It is shown that 
at Lf = 5, the performance without the matched filter is worse than with it. We can also observe 
the significant performance loss due to the small value of Lj. It is shown that Lf must be 15 for 
the system with no matched filter to provide the same performance as the matched filter system with 
Lf = 10. Hence, from the simulation results we can see that the matched filter is very important 
for system performance. Note that as discussed in the complexity analysis part. Subsection III-B, 
the forward and backward filter taps are actually fixed and can be obtained before the A-ML-DFBE 
detection. The complexity increase by the use of the matched filter is much more worthwhile than to 
increase the length of the forward filter without using the matched filter. 

In Fig. [51 simulation results for the A-ML-DFE detector are illustrated in comparison with conven- 
tional linear MMSE, MMSE-DFE, BAD, and MLSE decoders using LS channel estimation and the 
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vehicle speed is v = 150km /h. Here, we choose different values for Lf for A-ML-DFBE. From the 
simulation results, we can still observe that the performance of A-ML-DFBE converged at Lf = 10, 
and no gain can be obtained at Lf = 15. Due to the time-variant effects, the performance is degraded 
compared to the results in Fig. |3l We can see about 1 dB loss between MLSE and A-ML-DFBE with 
Lf = 10 when BER=10^^. However, the proposed equalizer can still substantially outperform Linear 
MMSE and MMSE-DFE in all SNR regime. Around 8 dB performance gain can be obtained by the 
proposed scheme with Lf = 5 compared to the BAD at BER=10^'^. 

VL Conclusions 

In this paper, we have proposed a simple approximate ML decision feedback equalizer for doubly 
selective fading environment. From the analytical and simulation results, we conclude that the A-ML- 
DFBE significantly outperforms the linear MMSE, MMSE-DFE, and BAD detectors, and provides 
performance very close to MLSE. We have shown that when is large enough, further increases 
in L J do not improve performance much. This implies that the proposed equalizer is quite robust 
against ISI. A tradeoff in terms of the complexity and the performance can be achieved by adjusting the 
value of Lf. Computational complexity comparison has demonstrated that the A-ML-DFBE requires 
fewer additions and multiplications than MMSE based schemes. In addition, the implementation of 
the matched filter is very important and the A-ML-DFBE obtains maximum diversity order when 
Lf>L. 

Due to the DEE processing, parallel computing is difficult to achieve for the proposed equalizer. 
However, by adjusting the size of the data block or the filters (back and forward), or both, the latency 
can be reduced. The proposed equalizer can be easily used for radar communication systems as 
it is suitable to solve time-domain equalization problems. In current wireless systems like UMTS, 
HSDR\ or HSUPA, the A-ML-DFBE can be used to recover signals similar to MMSE or MMSE- 
DFE. For LTE or LTE advance, the proposed algorithm can be extended to realize frequency-domain 
equalizations. 

Appendix A 

Derivation of Closed-Form Expression of 7^ at High SNR 

Now, the closed-form expression of 7^ at high SNR is derived in terms of Lf and L. From 
Subsection IV-A, 7^, can be written as 

7fe = ll^yfef =jfA~%=jf (a2x + YY^)-'j„ (24) 

where for convenience Y = J^+i has size Lf X Lf- 1, and X = J^,. By using the Kailath Variant 
(A + BC)"^ = A"^ - A"^B(I + CA"^B)-iCA-^ UM, the inversion term on the right side of (El 
can be further written as 
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7fc = a^^x-i _ cj-^x-iy {a^Uj-i + Y^X^^Y) Y^X^^ (25) 

At high SNR, as o"^ — )• 0^, the effect of (T^Il^._i is comparatively small, which can be ignored 
from an asymptotic point of view. Hence, we have the following approximation for the second term 
in ^ 

^-2x-1y(y^X-^Y)"^Y^X-i = CT^^X^^Z {Z^zy^ Z^X-2. (26) 

where Z = X with size Lfx{Lf — 1), and represents the unique positive definite Hermitian 
root ifBl . 

Let Z+ be the Moore-Penrose inverse of matrix Z, and Z+ = (Z^Z)^^Z^ of size (Lj — 1) x 
Lj. Note that rank(ZZ"'') = rank(Y) = — 1 and ZZ+ has size Lj x Lj. By eigenvalue 
decomposition, we can get ZZ+ = UIIU^ where U is the unitary eigenvector matrix and 11 = 
diag{Ai, . . . , Al^_i, 0}. From the definition of ZZ+, we have (ZZ+)2 = ZZ+. Therefore, ZZ+ is 
idempotent ifTSl . and any idempotent matrix has eigenvalue 1 or 0, and thus 11 = diagjl, ...,1,0}. 
We can then get 

j^X-iZ(Z^Z)-'z^X-ij, =j|^X-iunU^X-ij, « ^^lflj^X~%. (27) 



From dill), (125]|, ^B, and ([271), at high SNR, we can obtain 

1 



Ik « -^j^X-\, (28) 



From we can get 

i^X-% = hf Hfc(H|^Hfc)-iHf h,, (29) 

where = H^^^h^ and X = H^Hk has size Lf x Lf and rank Lf. Since Hki^k^k)'^^;^ 
same structure as Z (Z^Z) ^ Z^ in (l27l) . we can get the corresponding eigenvalues as 

EIG(Hfe(H|^Hfe)-iH|^) =diag{0^_^,l^_^, 0^_^ }. (30) 

k-l Lf N+L-Lf-k 

Finally, combining ([281) and ([291), at high SNR as cj^ 0+, finally we have 

min(L/— 1,L— 1) 



7fc 



E (31) 



4 = 
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TABLE I 

Approximate Maximum Likelihood Decision Feedback Block Equalization Algorithm. 



Step 1: Initialization of A-ML-DFBE 

L Set Lf (Lf > L). 
2. Fix Lb ~ L — 1. 
Step 2: A-ML-DFBE Detection 

¥or k = l: N -Lf 

\. Forward Process: Apply the temporal sub matched filter according to 

2. Decision Feedback Process: Remove the effects reconstructed by the past 

decisions using ( fTOl i. 

3. Approximate ML: Recover the transmitted signals with (flSl l. 

end; 

Step 3: Tail Detection 

L Fix the sliding window: 

2. Signal Recovery: 

¥or k ^ N - Lf + I : N 

1. Decision Feedback Process: Remove the effects of the past decisions 

using ( [Tol l. 

2. Approximate ML: Recover the transmitted signals with (flSl l. 

end; 
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TABLE n 

Computational complexity of various schemes for one sliding window with length Af; L is the number 

OF PATHS; BPSK CONSTELLATIONS; L/ AND STAND FOR THE LENGTH OF THE FORWARD AND BACKWARD FILTERS, 

RESPECTIVELY. 



Detector 


Additions 


Multiplications 


A-KL-JF3E 


iV[8Z.;' + :i ILj + (()L + 7}Lj + (:iL - 1); 


N\(2Ll - ntf ) - (12L + 19)L, -r 18] 


Linear-MMSE 


NlSLf + 30Lf + 2(3L + 2)L/] 


N[2I^ + A2Lj - (12L - 17)Lf - (6L - 1)] 


MMSE-DFE 


N[8{Lf + lI) + A2(Lf + Ll) + 2(3L + 2){Lf + U)] 


N[2{Lj + Lf ) + 42(L? + L?) + (12L - l\){Lf + Lb) + 6] 


BAD 


N[lG{Lj + Ll) + 84(L; + Ll) + 4(3L + 2)(Lf + L(,)] 


Af[4(L; + Ll) + 84(L; + Ll) + 2(12L - 11)(Z^ + Lj,) + 12] 
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Fig. 1. Analytical BER performance of the A-ML-DFBE over a doubly selective fading channel with perfect channel 
estimation (L = 5, fdTs = 0.0001) and simulated BER. 
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Fig. 2. Analytical BER performance of the A-ML-DFBE with various forward filter length over a doubly selective fading 
channel with perfect channel estimation (L = 5, fdTs = 0.0001). 
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. 3. Simulated BER performance of the A-ML-DFBE over a doubly selective fading channel with LS channel estimation 
= 5, fdTs = 0.0001). Shown for comparisons ai'e the Lineai' MMSE, MMSE-DFE, BAD, and MLSE. 
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Fig. 4. Simulated BER performance of the A-ML-DFBE over a doubly selective fading channel with perfect channel 
estimation (L = 5, fdTs — 0.0001). Shown for comparisons are A-ML-DFBE with and without matched filter (MF). 
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Fig. 5. Performance of the A-ML-DFBE over a doubly selective fading channel with LS channel estimation {L = 5, 
fdTs = 0.0093). Shown for comparisons are the Linear MMSE, MMSE-DFE, BAD, and MLSE. 



